Fault-tolerance threshold for a distance-three quantum code 
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The quantum error threshold is the highest (model-dependent) noise rate which we can toler- 
ate and still quantum-compute to arbitrary accuracy. Although noise thresholds are frequently 
estimated for the Steane seven-qubit, distance-three quantum code, there has been no proof that 
a constant threshold even exists for distance-three codes. We prove the existence of a constant 
threshold. The proven threshold is well below estimates, based on simulations and analytic models, 
of the true threshold, but at least it is now known to be positive. 
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PACS numbers: 03.67.Lx, 03.67.Pp 
I. INTRODUCTION 

Quantum operations are inherently noisy, so the devel- 
opment of fault-tolerance techniques is an essential part 
of progress toward a quantum computer. A quantum 
circuit with TV gates can only a priori tolerate 0{1/N) 
error per gate. In 1996, Shor showed how to tolerate 
O(l/poly(log A^)) error by encoding each qubit into a 
poly(log A^)-sized quantum error-correcting code, then 
implementing each gate of the desired quantum circuit 
directly on the encoded qubits, alternating computation 
and error-correction steps [1]. Even though the correc- 
tions themselves are imperfect, noise overall remains un- 
der control - the scheme is "fault-tolerant." 

Several groups [2-5] independently realized that by 
instead using a constant-sized quantum error-correcting 
code repeatedly concatenated on top of itself - and cor- 
recting lower levels more frequently than higher levels 
- a constant amount of error is tolerable, again with 
only polylogarithmic overhead. The tolerable noise rate, 
which Aharonov and Ben-Or proved to be positive [2], 
is known as the fault-tolerance threshold. Intuitively, 
small, constant-sized codes can be more efficient to use 
because encoding into the quantum code (which is nec- 
essary at the beginning of the computation and also dur- 
ing error correction, in certain schemes) is a threshold 
bottleneck. However, the threshold proof of Aharonov 
and Ben-Or only applies for concatenating codes of dis- 
tance five or higher. In this paper, we prove a con- 
stant noise threshold for the concatenated distance-three, 
seven-qubit Steane/Hamming code. (A threshold for the 
distance-three five-qubit code follows by the same struc- 
ture of arguments.) 

The attainable threshold value, and the overhead re- 
quired to attain it, are together of considerable exper- 
imental interest. Thus, while work has continued on 
proving the existence of constant thresholds in differ- 
ent settings - e.g., under physical locality constraints [6], 
or with non-Markovian noise [7] - a substantial amount 
of attention has been devoted to estimating the fault- 
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tolerance threshold, using simulations and analytic mod- 
eling. Most of these threshold estimates have used the 
seven-qubit code, from basic estimates [2, 8-10], to esti- 
mates using optimized fault-tolerance schemes [11-13], to 
a threshold estimate with a two-dimensional locality con- 
straint [14]. One reason the seven-qubit code has been so 
popular is no doubt its elegant simplicity, and its small 
size allows for easy, efficient simulations. However, there 
had been no proof that a threshold even existed for the 
simulated fault-tolerance schemes. 

Currently, the highest error threshold estimate is due 
to Knill, who has estimated a threshold perhaps as high 
as 5% by using a very efficient distance-two code with a 
novel fault-tolerance scheme [16]. Being of distance two, 
the code only allows for error detection, not correction, 
so the scheme uses extensive rejection testing. This leads 
to an enormous overhead at high error rates, limiting 
the practicality of operating a quantum computer in this 
regime. Still, a major open problem remains to prove the 
existence of a threshold for a distance-two code. 

We prove an error threshold lower bound of 6.75 x 10^^ 
in a certain error model. Our analysis is prioritized for 
proof simplicity and ease of presentation, not for a high 
threshold (although we discuss optimizations in Sec. IV). 
Also, it is not surprising that unproven threshold esti- 
mates should be significantly higher than proven thresh- 
old lower bounds - although actually the author is un- 
aware of any published rigorous lower bounds besides the 
current work and Ref. [17] (except in the erasure error 
model [18]). But such a large gap between what we can 
prove, and what our models and simulations indicate is 
embarrassing. A second major open problem is to close 
the gap between proofs and estimates. 

Our proof is based on giving a recursive characteriza- 
tion of the probability distribution of errors in blocks of 
the concatenated code. Intuitively, with a distance-three 
code, two errors in a code block is a bad event, so the 
block error rate should drop roughly like cp^ , with p the 
bit error rate. After two levels of concatenation, the error 
rate should be like c(cp^)^ = (cp)^ /c, and so on[25]. The 
threshold for improvement is at p = 1/c. The difficulty 
lies in formalizing and making rigorous this intuition. 

The classic threshold proof of Aharonov and Ben-Or 
[2] can be reformulated to rely on a key definition of 1- 
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goodness. Roughly, define a code block to be 1-good 
if it has at most one subblock which is not itself 1- 
good. Maintaining this definition as an inductive invari- 
ant through the logical circuit ~ i.e., proving that the 
outputs of a logical gate are 1-good if the inputs are, 
with high probability - allows provable thresholds for 
concatenating codes of distance five or higher. But this 
definition does not suffice for concatenating a distance- 
three code. For take a 1-good block, with the allowed 
one erroneous subblock, and apply a logical gate to it. If 
a single subblock failure occurs while applying the logi- 
cal gate, there can be two bad subblocks total, enough 
to flip the state of the whole block (since the distance 
is only three). Therefore, the block failure rate is first- 
order in the subblock failure rate. Logical behavior is not 
necessarily improved by encoding, and the basic premise 
of fault tolerance, controlling errors even with imperfect 
controls, is violated. 

Essentially, a stronger inductive assumption is required 
for the proof to go through. With 1-goodness, we are as- 
suming that the block entering a computation step has no 
more than one bad subblock. Intuitively, though, most 
of the time there should be no bad subblocks at all. We 
capture this intuition in the stronger definition of "1- 
wellness." In a 1-well block, not only is there at most 
one bad subblock, but also the probability of a bad sub- 
block is small. With this definition, the problem sketched 
above does not occur because the probability of there be- 
ing an erroneous subblock in the input is already first- 
order, so a logical failure is still a second-order event. 
For the argument to go through, though, the definition 
must be carefully stated, and we need to carefully define 
what is required by each logical gate and how the logical 
gates will be implemented. Controlling the probability 
distribution of errors is the main technical tool and new 
contribution of this paper. 

Very recently, Aliferis, Gottesman, and Preskill inde- 
pendently completed a threshold proof for distance-three 
codes [17], based instead on formalizing the "overlapping 
steps" threshold argument of Knill, Lafiamme and Zurek 
[8]. Our probabilistic definitions may be more difficult 
to extend to different error models. However, the prob- 
abilistic structure is also a potential strength, in that 
it may make this proof more extensible toward provable 
thresholds for postselection-based, error-detection fault- 
tolerance schemes like that of Knill [16]. 

The intuition that small codes perform better needs 
to be qualified. Small codes perform well because they 
can be quickly encoded and so allow frequent, rapid er- 
ror correction at the lowest level. However, larger codes 
offer protection with higher distances. There is therefore 
potentially a tradeoff. Steane finds that at noise rates 
well below the threshold it is best to start with a small 
code at the lowest concatenation level, then switch to a 
larger code [15]. Also, if measurements are slow, then 
small codes can no longer perform rapid error correction, 
because they are waiting for measurements to complete. 
In this case, Steane estimates that, from among a large 



set of codes, the seven-qubit code comes in only third be- 
hind the 23-qubit Golay code (distance 7) and a 47-qubit 
quadratic residue code (distance 11), which offer better 
efficiency compromises. 

Sections II and III contain the necessary definitions, 
and the proof of a threshold for quantum stabilizer oper- 
ations (meaning preparation of fresh qubits as [0), mea- 
surement in the computational basis, and application of 
Clifford group unitaries like the CNOT gate). Stabilizer 
operations are easy to work with because Pauli errors 
(bit flips and dual phase flips) propagate through lin- 
early. While encoding and error-correction only require 
stabilizer operations, stabilizer operations do not form 
a universal gate set - in fact, a circuit consisting only 
of quantum stabilizer operations can be efficiently classi- 
cally simulated [19]. Section V extends the proof to give a 
threshold for full universal quantum computation, using 
the technique of "magic states distillation" [16, 20, 21]. 
In fact, the threshold itself is unaffected by this exten- 
sion ~ the bottleneck in our threshold proof, as in most 
threshold estimates, is in achieving stabilizer operation 
fault-tolerance. 



II. DEFINITIONS 
A. Concatenated Steane code 

In concatenated coding, qubits are arranged into level- 
one blocks of n, which are in turn arranged into level- two 
blocks of n, and so on. We call a single qubit a blockp, n 
grouped qubits a blocki , and n'^ grouped qubits a blocks 
(but often extraneous subscripts will be omitted). 

We will use the distance-three Steane code on n = 7 
qubits. Recall its stabilizer generators 

inzzzz,izznzz, ziziziz, 

IIIXXXX, IXXIIXX, XIXIXIX, 

Here, X = (lo) is a bit flip operator ~ X\Q) — 
|1),A:|1) = |0) - while Z = is a phase flip oper- 

ator. The flrst three stabilizer generators are equivalent 
to the classical [7, 4, 3] Hamming code's parity checks, 
while the last three stabilizers are the same but in a dual 
basis. (As is customary, tensor signs are omitted for leg- 
ibility.) With this code, encoded, or logical, X and Z 
operators are transverse X and Z operators, respectively: 
i.e., Xl = X®\ Zl = Z®\ Other logical Chfford group 
operations are also performed by transverse physical Clif- 
ford group operations. For example, the CNOT gate is 
defined by CNOT|a, 6) = [a, a ©6) for a,6e {0,1}. Log- 
ical CNOT is just transverse CNOT. Steane code encod- 
ing circuits are given in Fig. 1. 

B. Error model 

Definition 1 (Base error model). Assume each 
CNOT gate fails with probability p, independently of 
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FIG. 1: Circuit C (left) encodes the ancilla |0)i using nine 
CNOT gates. To encode an arbitrary state requires C 
and two more CNOT gates (right). 



the others and earlier or simultaneous measurement out- 
comes, resulting in one of the sixteen Pauli products 
I®I,I^X,...,Z^Z being applied to the involved 
qubits after a perfect CNOT gate. Assume that single- 
qubit Clifford group operations are perfect, that single- 
qubit preparation and measurement is perfect, and that 
there is no memory error. 

For ease of exposition, we consider a very simple er- 
ror model, defined to start only for stabilizer operations. 
Several of these assumptions are not essential. There be- 
ing no memory error is essential only as it implies arbi- 
trary control parallelism, which is essential in threshold 
schemes. The independence assumption of the CNOT 
failures can be relaxed as long as the as the conditional 
probability of failure (regardless of earlier or simultane- 
ous events) remains at most p. The probabilistic nature 
of the failures is however essential for the proof in its cur- 
rent form. Probabilistic failures of other operations can 
be straightforwardly incorporated. 

As is standard, additionally assume perfect classical 
control with feedback based on measurements. All the 
required classical computations are efficient. It is often 
assumed that classical computations are instantaneous, 
although this assumption doesn't matter with no memory 
error and even in general it can carefully be removed. The 
use of feedback is important, particularly for the scheme 
we use to gain universality. 



C. Error states 

X, Z or Y = iXZ errors are tracked from their in- 
troduction through the circuit by the commutation rules 
of Fig. 2. Some errors may have trivial effect - e.g., 
Z\0) = |0) " but we still record them; we do not reduce 
errors modulo the stabilizer. When we later extend the 
proof to a universal gate set, top-level logical Pauli er- 
rors can no longer be traced through the circuit, but we 
will make certain that these errors happen with vanishing 
probability. 

Since the code has distance three, error-free decoding 



FIG. 2: Propagation of Pauli errors through a CNOT gate; 
X errors are copied forward and Z errors copied backward. 



of a blocki is well-defined. Error- free (perfect), bottom- 
up decoding of a blocks is defined recursively by first de- 
coding its subblocksfc-i, to interpret their statesfe_i, then 
decoding the block. Note that this recursive procedure is 
not the same as correcting to the closest codeword, but 
it is easier to analyze. 

Definition 2 (State). The statco of a qubit is cither I, 
X, Y, or Z, depending on what we have tracked onto that 
bit. The statCk of a blockk is I, X, Y, or Z, determined 
by error-free decoding of the statesk-i of its subblocks. 

The state of a block is determined by the states of its 
subblocks. We want to define the relative states of the 
subblocks, because a probability distribution over sub- 
block errors is most naturally defined keeping in mind 
(i.e., relative to) the state of the enclosing block. If a 
block is in error, then necessarily some of its subblocks 
will be in error. 

As a simple example, consider the classical three-bit 
repetition code: Ol = 000, 1l = HI. If the states of three 
bits are XII - the first bit is in error - then the block's 
state decodes to be I. The first bit is also in relative error. 
If the states of three bits are IXX, then the block's state 
decodes to be X. The first bit is said to be in relative 
error (although it is not in error). Making this definition 
precise, particularly in the quantum case, requires some 
care because different errors can be equivalent. 
Definition 3 (Relative syndromefe). The relative 
syndrome^ of a blocks consist of the syndromes of the 
n — 1 code stabilizer generators on the stateSk^i of the 
subblocksk-i. 

Definition 4 (Relative statefc_i). The relative 
statesk-i of subblocksk-i of a blocks are given by the 
minimum weight error, counting Y errors as two, gener- 
ating the relative syndromck of the blockk . 

There is a unique minimum weight error, so this no- 
tion is well-defined, since every error syndrome can be 
achieved with at most one X and one Z error (or with 
one Y error). Note that according to this definition, rel- 
ative statesfc_i are intuitively "relative" to the state^, 
and are not recursively related to the relative state^. 

For example if the first two subblocks are in X error, 
then the block's state is X error, with the third subblock 
in relative X error (since logical X is equivalent to XXXI- 
III). Here is another example with a statcfe of X: 



statesfc_i 
relative statesfc_i 



I IIIIYX 
XIIIIZ I . 



Unlike its state, the (X or Z component of the) rela- 
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tive state of a subblock can be determined by measur- 
ing the block transversely (in the Z or X eigenbases). 
Again using the three-qubit repetition code for an ex- 
ample, = 75(1000) + |111)). Since {XII)\+)l = 

{IXX)\+)l oc |100) -I- 1011), one can't measure the state 
of the first qubit. However, measuring in the 0/1 compu- 
tational basis (Z eigenbasis) gives 100 or Oil with equal 
probabilities, telling us in cither case that qubit 1 was in 
relative error (before the destructive measurement). 

D. Logical error model 

Definition 5 (Logical error model). The implemen- 
tation of a logical operation Uk on one or more block,% is 
said to have had the correct logical effect if the following 
diagram commutes: 

Uk 



U 

Here the vertical arrows indicate perfect recursive decod- 
ing of the involved blocks, and the lower horizontal arrow 
represents a perfect U on the decoded blocks. 

Uk has had an incorrect logical effect if the same di- 
agram commutes but with P o U on the bottom arrow, 
where P is a Pauli operator or Pauli product on the in- 
volved blocks. 

In our error model, with our implementations, every 
logical operation will have cither the correct logical ef- 
fect or an incorrect logical effect with some P probabilis- 
tically. For example, in error correction of a block, U 
is the identity. Error correction has the correct logical 
effect (no logical effect) if the state of the system follow- 
ing a perfect recursive decoding on the corrected blocks 
is the same as if we had just perfectly decoded the in- 
put blocks (and then applied the identity). In particular, 
this implies that the states of the output is the same as 
the statcfc of the input, but, more than that, also logical 
entanglement is preserved. 

E. Goodness and wellness 

A key problem in proving a threshold is in establishing 
the proper definitions for inductively controlling the er- 
rors. Once the correct definitions have been stated care- 
fully, proving the relationships among them needed for a 
threshold result is fairly straightforward. 

The classic proof of a threshold in this setting, due to 
Aharonov and Ben-Or [2], can be framed as relying on 
the definition: 



Definition 6 (r-goodfe). A blockk is r-goodk (and not 
r-badk) if it has < r subblocksk-i which are either in 
relative error or not r-goodk-i- A blocks (single qubit) 
is r-goodo if it is not in relative error. 

So in a 1-good block, we have control over errors in 
n — 1 of the subblocks (they are 1-good themselves and 
not in relative error), but potentially no control over the 
state of one of the subblocks. 

Definition 6 does not suffice for proving a threshold for 
a distance-three code because there is no room for errors 
in blocks which interact. We can't maintain the inductive 
assumption of each block being 1-good because as soon 
as two blocks interact, they will then each have two sub- 
blocks with uncontrolled errors (with a priori constant 
probability, not second-order probability as we desire). 
Aliferis ct al. manage to use a similar definition, but 
change the method of induction proof to involve "over- 
lapping steps." We instead will use a similar inductive 
proof to Aharonov and Ben-Or, but with different defi- 
nitions to look at probability distributions of errors. 

To give a threshold argument with a d = 3 code, we will 
use a definition for probability distributions over relative 
errors. 

Definition 7 (v^rellfc). A blockk is wcllk(pi, ■ . ■ ,Pk) if, 
conditioned on its state and on the errors in all other 
blocks, it has at most one subblock cither in relative error 
or not wellk-i{pi, • . • ,Pk-i), and 

P{such an uncontrolled subblock) < pk. 

This definition conditions on the errors in all other 
blocks as a measure of independence. Note that we don't 
assert anything about the distribution of errors within 
a subblock in relative error. This is important because 
errors within relatively erroneous subblocks are typically 
less well controlled. (For example, consider a perfect 7- 
qubit blocki, and introduce bitwise independent errors. 
When the block as a whole is in error, one relative bit 
error is more likely than none, since two bit errors are 
more likely than three.) 

For example, again using the three-qubit repetition 
code, the ensemble 

/// w/ prob. 1 — p 
IXX w/ prob. p 

is not welli(p). Even though the probability of a relative 
error is < p, conditioning on a logical state of X there is 
a relative error with probability one. 

Definition 7 can be generalized to r-wellfc, a require- 
ment on probability distributions with up to r relative 
errors, but for a concatenated distance-three code thresh- 
old, r = 1 is sufficient and so I have omitted any prefix. 

III. FAULT-TOLERANCE FOR STABILIZER 
OPERATIONS 

Our proof of fault-tolerance for stabilizer operations 
will rely on three indexed Claims A^, B^ and for, re- 
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spectively, encoded ancilla preparation, error correction 
and encoded CNOT, at code-concatenation level k, with 
the following inductive dependencies: 



A fc - 1 k 




C fc- 1 — - fc 



That is, as shown in Fig. 3, a level-fc encoded CNOT, or 
CNOTfc, will use CNOTsfc_i and error corrections^ - so 
the proof of Claim Ck will rely on Claims Gk-i and B^. 
Error correction^, will use correctionSfe_i and CNOTs;;_i, 
as well as ancillasfe. Finally, the proof of Claim A^. 
(ancillafc preparation) will rely on each of Claims Afe_i, 
Bfe_i and C/c-i- 

Each levcl-fc operation will fail with probability Ak, Bk 
or Ck (failure parameters are italicized unlike the names 
of the claims to which they correspond). Failure parame- 
ters will drop quadratically at each level, giving a thresh- 
old as sketched in Sect. I. That is, 

max{Ak,Bk,Ck} = 0{{ma.x{Ak-i, Bk-i,Ck-i}f) ■ 

Splitting out separate error parameters in this way lets 
us easily track where errors are coming from, and lets 
us find the threshold bottlenecks for optimization. We 
will also define two wellness parameters Uk and bk (since 
a CNOTfe ends with error correctionsfc, there is no need 
for a separate wellness parameter Ck). 

Relaxing some of the assumptions in our error model 
would just require modifying these claims, and possibly 
adding new ones. For example, we have assumed perfect 
measurements, but faulty measurements would only re- 
quire a fourth indexed claim dependent only on itself (a 
level-fc measurement outcome is the decoding of n level- 
(fc — 1) measurement outcomes). 

Preparation of a single-qubit ancilla in state |0)o = |0) 
or |-f)o = = "^(|0) + |1)) assume to be perfect, 
so define A^ = 0. (Alternatively, set > to remove 
this assumption.) For fc > 1, we need: 

Claim Afc (Ancilla^). Except with failure probability at 
most Ak, we can prepare a level fc ancilla \0)k which has 
a statck of I (no error) and is wellk (bi, . . . ,bk-i, cik). 

(The different parameters will be set explicitly within the 
proofs.) 

Error correction^ is only defined for fc > 1 levels of 
encoding, so we may take Bq = 0. 

Claim Bfe (Correction^). With probability at least 1 — 
Bk, the output is wellk(bi, . . . ,bk) and, if the input is 
wellk(bi, . . . ,bk), there is no logical effect. 

Additionally, if all but one of the input subblocksk-i 
are wellk-i (bi, . . . , bk-i) and not in relative error, then 
with probability at least 1 — _B|,, there is no logical effect 
and the output is wellk (bi, . . . ,bk). 



Note how powerful a successful correction^ is. Even if 
there is no control whatsoever on the errors in the input 
block, the output is still wellfc. This property is essen- 
tial for getting errors fixed in a fully recursive manner, 
because it means that to fix an erroneous subblockfc_i, 
we need only apply a single-qubit correction transversely 
on that subblock and there is no need to worry about bit 
errors within that subblock. 

CNOTo is simply a physical CNOT gate. Cq = p is 
the probability of failure of a physical CNOT gate (for 
the oft-used simultaneous depolarization error model, p 
is yI times the depolarization rate). 

Claim Ck (CNOT^). With probability at least 1 - Ck, 
the output blocksk are wellk (bi, . . . ,bk) and, if the input 
blocksk are wellk (bi, ... ,bk), then a logical CNOT, the 
correct logical effect, is applied. 

Proof of Claim Ck ■ CNOT^, is implemented as trans- 
verse CNOTfc_i, followed by Correctioufe on each block. 

Always declare failure if either Correction^ fails. This 
gives the desired behavior on the case of general inputs 
with Ck > 'iBk', on success, the outputs are welU because 
they are leaving successful Corrections^. Assume now 
that the input blocks arc each wcllfe(6i, . . . , bk). 

If none of the input subblocks arc in relative error 
(probability at least 1 — 26^), we say the CNOTj, has 
failed if either of the CorrectionSfc fail, or if two or more 
CNOTsfe_i fail. The conditional failure rate is 

< (1 - Ck-ir2Bk + nCk-i2B'^ + ( 5 ) C^^,. 

If one of the input subblocks is in relative error (prob. 
< 2bk), we say the CNOT^. has failed if either of the 
Corrections^ fail, or if one or more CNOTsfe_i fail. The 
conditional failure rate is 

< {l-Ck-ir2B',^+nCk-i. 

If two input subblocks are in relative error (prob. < 
bl), we say the CNOT^ has failed. 

On success, the correct logical effect has been ap- 
plied (Def. 5). Indeed, the successful corrections have 
no logical effect, so the ideal decoding can be commuted 
past them. All but possibly one of the CNOTsfc_i have 
the correct logical effect, so decoding up to level fc — 1 
can be commuted past them, too, leaving ideal trans- 
verse CNOT gates. Transverse CNOT gates are an en- 
coded/logical CNOT gate for the Steane code (Sec. II A). 
The final decoding round can tolerate one subblock in er- 
ror from each block so it also commutes forward, leaving 
a single ideal CNOT gate. Overall, the probability of 
failure is at most 

Ck = {2Bk + 2nCk-iBi + ( 5 ) C|_i) 

+ 2bk{2B',,+nCk-i) + bl. □ 

Proof of Claim Bk. Correction^, shown in Fig. 3, begins 
with transverse Corrections/j_i. Then correct for X rel- 
ative errors using a Z syndrome extraction (explained 
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FIG. 3: A concatenation-level-fc encoded CNOT, or CNOT^, acting on two blocks each with n subblocks (n* qubits in aU), 
is implemented as transverse CNOTsfc_i (i.e., CNOTsfc_i from subblock i of the first input block to subblock i of the second 
input block, i — 1, . . . ,n) followed by level-fc error corrections on each block. Error correctioufc is implemented as transverse 
error corrections^^! , followed by transverse CNOTsfc_i to and from prepared ancillasfe. These ancillas are measured, and an 
appropriate correction (a level-(fc — 1) encoded operation) is possibly applied. Then the whole process is repeated. 



below), and symmetrically correct for Z relative errors. 
Then repeat: Apply transverse CorrectionSfe_i and cor- 
rect for X and Z relative errors again. 

To extract a Z syndrome, used for correcting X (bit 
flip) relative errors, apply transverse CNOTsfc_i into a 
ancilla [15]. (Logically, there should be no effect, 
since |+) = 75 (|0) + |1)) is an eigenstate of the NOT/X 
operation. But as in Fig. 2, X errors will be copied into 
the ancilla.) Measure this ancilla transversely in the 0/1 
computational basis (Z eigenbasis) and bottom-up recur- 
sively decode in order to determine which ancilla sub- 
block, if any, is in X relative error. Apply X gates trans- 
versely to this subblock as a correction [26]. 

We give only a combinatorially loose analysis; some 
optimizations are described in Sec. IV. There arc three 
cases, depending on the input. 

Always declare failure if any of the four ancilla prepa- 
rations fails. 

1. If the input is already wellfc(6i, . . . , bk), declare fail- 
ure if there are two level fc — 1 failures. Overall, there 
arc 2n Corrections^-i, An CNOTs^-i, and four prepared 
ancillasfe, so the probability of failure is at most 

B, ^ 4Afc + ( t ) Cl_, + (i)al + {^i) Bl_, 
+ fofc(4nCfe_i -I- 4afe + 2nBk-i) 
+ AnCk-ii-iak + 2nBk-i) + 8nakBk-i. 

We can conservatively set the block output wellness pa- 
rameter bk to be the probability of any single level fc — 1 
failure occurring in the correction circuit - not in the 
input: 

bk>bk = (4afc + 2nBk-i + AnCk-i)/{l - Bk). 
Relative errors in the input will be corrected and won't 



show up as relative errors in the output (assuming suc- 
cess). 

By combining commutative diagrams from Def. 5, 
clearly on success the output is well and no logical oper- 
ation has been applied. 

2. In the case that all but one of the input subblocks 
are wellfe(6i, . . . , 6fc_i) and not in relative error, again 
the output will be well with probability at least 1 — Bk- 
However, a single level fc — 1 failure can change the state 
of the block, since the code has only distance three. We 
introduce the failure parameter B'f, = 4:Ak + {I — Bk)bk to 
bound the probability that the state changes. (This is too 
conservative since only a single one of the early level fc — 1 
operations can change the state; later on, two level fc — 1 
failures arc still required.) Set bk = (1 — Bk)bk/{1 — B^). 
In the parameter range of interest, bk > bk- 

3. Consider now the most interesting case, which has 
dictated our implementation of Correction^: when the 
input is uncontrolled. (In either of the two previous cases, 
one round of transverse CorrectionSfe_i followed by X 
and Z correction would have sufficed.) In this case, we 
can make no guarantees about the logical effect of the 
correction, but want the output to be well. 

Here is an example which shows why this arbitrary 
input case is more complicated: Say that exactly all in- 
put subblocks are well, with exactly one in error. In this 
case, one level fc — 1 failure in the first X and Z correction 
rounds can change the whole block's state. This is okay; 
we are making no guarantees about the output state ver- 
sus the input state. The problem is that conditioning on 
the block's state having changed (as in the definition of 
well) , there is a zeroth-order probability of a subblock in 
relative error. Thus the output after the first rounds of 
X and Z correction is not necessarily well. 
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If there are no level k — 1 failures in the first X and 
Z correction rounds, then the block halfway through will 
already have all its subblocks well and none in relative 
error. The second X and Z correction rounds will leave 
the block well. 

If there is a level fc — 1 failure in the first X and Z 
correction rounds, then leaving these rounds there can be 
one subblocks which is not well and one subblock which 
is in relative error, as in the example. However, no level 
k ~ 1 failures are then allowed in the second X and Z 
correction rounds. The transverse CorrectionSfc_i will 
fix the subblock (if any) which is not well. The X and Z 
correction rounds will fix the subblock in relative error. 

□ 

Ancilla preparation is a key step in a fault-tolerance 
scheme. (This is particularly true for schemes based on 
teleportation, like those of Knill [16, 18], and in our own 
method for achieving universality in Sect. V.) Only dur- 
ing ancilla preparation do subblocks within a particular 
block interact with each other. The preparation scheme 
must dampen these interactions to avoid strong correla- 
tions between errors in different subblocks. The proof of 
Claim Afe runs along similar lines as the above proofs, 
and is given in Appendix A. The failure parameters de- 
termined there are 

Ak = +2n{2s{n) + n){Ak-i+Bk-i)Ck^i\ /Nk 
\ +An^Ak-iBk-J 
2n{Ak-i + Bk-i) + (2.s(n) + n)Ck-i 
il-A,)Nu 

where we may take s(n = 7) = 9 and 

Nk=l- 2nAk-i - 2nBk-i - (2s(n) 4- ?i)Ck-i. 

IV. ESTIMATES OF THE CONSTANT 
THRESHOLD 

The results in Sec. Ill give the claimed constant thresh- 
old for stabilizer operations, because the error parameters 
each drop quadratically at each level of concatenation. 
This is straightforward to prove rigorously by replacing 
complex inequalities with simpler upper bounds. 

Our goal was to complete a rigorous proof of a constant 
threshold, not to estimate the true threshold. Still, it is 
interesting how high a threshold these techniques give us. 
We have numerically iterated the equations of Sec. Ill, 
taking n = 7. We found that the error rates converged 
to zero for p = Cq < 6.75 x 10~^. Of course this is not 
a proof that the equations converge in this range, but 
the proof is doable: simply bound arithmetic errors while 
iterating the equations numerically up to a level at which 
weaker bounds can be applied to show convergence. 

This threshold does not compare directly to the 4.18 x 
10~^ rigorous threshold lower bound of Alifcris et al. [17] 



because their error model allows faults in singlc-qubit 
preparation and measurement as well as just CNOT 
gates. This means that preparing an unverified ancilla, 
for example, has 16 possible fault locations instead of 
just 9. We expect that incorporating these single-qubit 
errors into our model would reduce the threshold by a 
small constant. Also, our recursion equations were highly 
conservative, typically bounding the probability of a level 
k failure by the probability of any two level fc — 1 fail- 
ures. Aliferis et al., however, used a computer to count 
exactly which pairs of level fc — 1 faults could cause a 
level fc failure. This counting obviously directly improves 
their threshold estimate, but it improves the threshold 
estimate in an indirect way, too; it allows them to use 
more easily less modular constructions. 

There are a number of optimizations that can be car- 
ried out to improve the threshold slightly. We have briefly 
investigated improved ancilla preparation, improved im- 
plementation of CNOTfc, and improved analysis tech- 
niques. 

For example, in Claim Afc, we apply transverse 
Correctionfc_i at the end of preparing an unverified an- 
cilla. But each CNOTfc_i ends with CorrectionSfc_i any- 
way, so this is redundant and useful only for a more mod- 
ular analysis. Ancilla preparation can also be improved 
by verifying against both X and Z errors. 

Actually, though, with CNOTfc_i notice that if the in- 
put blocks are well, then full, two-round Correctioufc-i is 
not needed on each output block - instead, a single round, 
using two ancillas instead of four, suffices. In unveri- 
fied ancilla preparation with this version of CNOTfc_i a 
transverse, two-round Corrcctionfe_i would be needed for 
the proof to go through, not just for modularity. Thus if 
we change our CNOT operations to be followed by a one- 
round correction, then during unverified ancilla prepara- 
tion, seven full corrections and eighteen one-round cor- 
rections total will be applied (or, sacrificing some mod- 
ularity, seven full and eleven one- round corrections), as 
compared to eighteen full corrections currently. This in- 
creases the threshold to 1.46 x 10~^, according to numer- 
ical iterations of the recursion equations. 

We might also split out two different kinds of CNOT/; 
gates, because if one of the output blocks of a CNOT 
gate is to be measured immediately (as is the case for 
the transverse CNOT gates in preparation), then error 
correction of that block can only introduce more errors. 
Another optimization is to switch to error-correction via 
teleportation [18]. A factor of two or three improvement 
in the proven threshold seems to be in reach using these 
kinds of optimizations - which are easy to write on the 
back of an envelope, but tedious to write out formally. 

The bottlenecks in our equations in fact appear to be 
ancilla preparation subblock failure Ok and CNOT failure 
Ck- It should be productive to improve the analysis of 
ancilla preparation - by for example verifying against Z 
as well as X errors, and by using computer counting to 
minimize the overhead from artificial modularity. Also, 
during the counting, one should try to distinguish X and 
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Z errors as much as possible - this might be simplest 
with a more specialized base error model. Once the error 
model is nailed down, one can actually simulate ancilla 
preparation at the lower few levels and determine higher 
estimates for ancilla failure rates just at these levels. This 
allows for analyzing more complicated ancilla prepara- 
tion schemes, including schemes based more heavily on 
postselection [11]. Assuming (or proving) these estimates 
to be trustworthy can increase the threshold significantly, 
even though only a few levels (e.g., Ai, oi, 02) are di- 
rectly modified. 

These threshold estimates for stabilizer operations re- 
main applicable for full quantum universality, discussed 
in Sec. V. The techniques we use to gain universality 
can tolerate higher error rates than those for stabilizer 
operations, so achieving universality is not a bottleneck. 



V. FAULT-TOLERANT UNIVERSALITY 

To achieve universality, we use the technique of "magic 
states distillation" [16, 20, 21]. It is well known that sta- 
bilizer operations plus the ability to repeatedly prepare 
one of the "magic," single-qubit pure states \H) or |T) 
gives quantum universality. (More generally, in fact, sta- 
bilizer operations plus the ability to repeatedly prepare 
any single-qubit pure state which is not a Pauli eigenstate 
gives quantum universality.) Magic states distillation is 
a technique for using perfect stabilizer operations to dis- 
till faulty copies of \H) or \T) to perfect copies (or at 
least, arbitrarily close to perfect with only poly logarith- 
mic overhead). 

Operating beneath the threshold for stabilizer opera- 
tions, we can assume the error rate is arbitrarily small, 
and therefore condition on no stabilizer operation errors 
at all. So we have perfect stabilizer operations. To apply 
the technique of magic states distillation to achieve uni- 
versality, following Knill [16, 22], we create an encoded 
Bell pair. Then wc decode one half from the bottom 
up. (Unlike Knill, we do not postsclcct on no detected 
errors.) 

At this point, we have a Bell pair, half of which is 
a single qubit unprotected from errors, and the other 
half is encoded. Teleport a single-qubit (approximate) 
"magic state" \H) or \T) into the encoding, using a phys- 
ical CNOT gate and two single-qubit measurements. Of 
course, there can be errors in the teleportation. But any 
errors will also be teleported, into logical errors on the 
remaining encoded \H) or |T). Encoded stabilizer op- 
erations can then distill these into perfect logical magic 
states (meaning no logical errors, bit errors of course re- 
main), from which we obtain encoded universality. 

To prove that this scheme works, we need to under- 
stand the behavior of the recursive, bottom-up decoding 
procedure. It is fairly straightforward that this works, 
maintaining a bounded error rate; a simple proof is given 
below. We also need to verify that the magic states dis- 
tillation procedures apply in our setting. 



A. Decoding 

Lemma Di (Decodingi). Consider a wellifpi) blocki. 
On each bit independently, apply a Pauli error with prob- 
ability at most qi . Decoding of the block can still be done 
successfully - meaning the state of the system is the same 
as if perfect decoding had been applied - with probability 
at least 1 — Di{pi, qi). The rest of the time, the system's 
state is the same as if either I, X, Y or Z had been applied 
after perfect decoding. 

Proof. If any CNOT gate fails, then an error can be in- 
troduced (there is no fault tolerance). Additionally, an 
error will be introduced if there are two or more errors 
in the input after the bit errors are applied, which occurs 
with probability at most npiqi + (S)??- Letting e(n) 
bound the number of CNOT gates required for encod- 
ing/decoding an 71-qubit code (e(n) = 0(n^) but is pos- 
sibly > s{n) defined in the proof of Claim A^ as the num- 
ber of CNOTs required for encoding a stabilizer state), 
we may take 

Di{pi,qi) EE e{n)CQ+npiqi + (2)9?- 

For the Steanc code, set e(7) = 11 (Fig. 1). □ 

Claim Dfc (Decodingfe). Decoding of a 
wellk (bi, . . . ,bk) blockk can be done successfully - 
meaning the state of the system is the same as if perfect 
decoding had been applied - with probability at least 
1 — Dk . On failure, the system 's state is the same as if 
either I, X, Y or Z is applied after perfect decoding. 

D, = e{n)Co 
Dk = e(n)Co + ubkB^^i + ( S ) DI_^. 

Proof. First apply Lemma Di withpi = bi and qi = 0, in 
order to decode the bottom layer of a well2(6i, 62) block2. 
This outputs a welli(62) blocki, except each bit has an 
additional failure probability of at most Di. Thus the 
claim can be applied again with pi — 62 and qi = Di 
to get a single bit out which will be correct except with 
a failure probability which we bound by D2. Continue 
inductively. □ 

Clearly if p = Co is beneath the threshold for stabilizer 
operations (so bk converges to zero quadratically at each 
level), then too there is a threshold for p beneath which 
converges to a constant. In our numerical thresh- 
old estimates, the threshold for Dk is equal to that for 
stabilizer operations. 

B. Magic states distillation 

Applying the magic states distillation results of Ref. 's 
[20, 21] requires a little bit of work. The main concern 
here is that these results were derived under the assump- 
tion that multiple identical copies of p the input faulty 
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state can be prepared at will. However, in our setting we 
can't assume that different encoded approximate magic 
states have exactly the same logical error rates. In fact, 
at least for the |T) state distillation scheme, the bound on 
the allowed error rate is the same for nonidentical states 
as for identical states. (Numerical experiments indicate 
that the same is probably true for \H) state distillation 
schemes.) 

The x,y,z Bloch sphere coordinates of a single-qubit 
state p are ^Tt{X.p), ^Ti{Y.p), ^Ti{Z.p), respectively. 
Define |T) by coordinates x ^ y ^ z ^ l/VS of \T){T\. 
Let T = (e^'^'/s - l)|r)(T| + /. Before beginning distilla- 
tion, symmetrize each state by applying /, T or with 
equal probabilities 1/3 - so we may assume each state 
has equal x, y, z coordinates. 

Lemma 1. Perfect adaptive stabilizer operations to- 
gether with the ability to prepare states pi, i = 1,2, . . 
with Bloch sphere coordinates Xi ~ yi ^ Zi ~ fi/^/S, such 
that fi > \/3/7 + e, e > 0, gives quantum universality. 
Here e > 0, a constant, is known, but the fi may not be. 

A proof is given in Appendix B. 

VI. CONCLUSION 

We have proved: 

Theorem 1. For the error model specified in Def. 1, ar- 
bitrarily accurate, efficient, universal quantum computa- 
tion is achievable, via a scheme based on concatenation 
of the |7, 1,3] Steane code, as long as the error rate is 
beneath a positive constant threshold. 

Note that repeated syndrome extraction during error 
correction is not necessary as long as the ancillas used 
in error corrections are prepared fault-tolerantly (we use 
a simple postselection scheme, but repeated syndrome 
extraction to verify ancillas would also have sufficed). It 
should also be emphasized that preparation of reliable 
encoded states is a major threshold bottleneck. 

A major open question in quantum fault-tolerance lies 
in rigorously proving higher thresholds. As the current 
highest threshold estimates rely heavily on postselection, 
a good understanding of the probability distribution of 
errors seems to be necessary to prove results about these 
schemes. Our arguments comprise a first, minor step 
in this direction, but to go further one needs to char- 
acterize the probability distribution of errors within the 
blocks which are in relative error. One also needs to be 
concerned about any dependencies between blocks - for 
even in very artificial models full independence is impos- 
sible to maintain (or even converge to asymptotically), 
and small deviations can rapidly build up into large de- 
pendencies when postselections discard large fractions of 
the probability mass. Perhaps combining a probabilistic 
analysis with the overlapping steps method of Aliferis et 
al. could lessen the strict analysis requirements which 



make a rigorous proof in a postselected setting so diffi- 
cult. 

There are many other open problems, of course. How 
high can the provable threshold be pushed for schemes 
not based on postselection? We are still several orders 
of magnitude below some of the higher estimates. One 
promising way to approach this problem is with a rig- 
orous, but computer-aided, analysis of the lower levels 
of the fault-tolerance scheme, then "plugging in" to a 
conservative, analytical estimate once the error rate has 
dropped sufficiently. Running Monte Carlo simulations, 
then fitting the simulation results to our failure and well- 
ness parameters, is another method for understanding er- 
ror behavior - although then we would obtain only con- 
fidence intervals for the threshold. (Since the threshold 
bottleneck is in achieving fault-tolerant stabilizer opera- 
tions, which are classically simulatable, simulations are 
efficient.) 

Can this threshold proof be extended to more gen- 
eral error models (not just probabilistic Pauli errors)? A 
real quantum computer is likely to use lower-level error- 
correction techniques highly specialized to the physi- 
cal error model ~ for example composite pulses [23] 
- before switching to a more general fault-tolerance 
scheme. It is important to better understand the in- 
terface/transition between the low- and high-level error 
models/error-correction schemes. 

Efficiency is a major practical concern; even con- 
stants are very important. All constant-threshold fault- 
tolerance schemes should ultimately have the same effi- 
ciency in big-O notation - one can just switch to the most 
efficient scheme once a less efficient scheme has reduced 
the error rate sufficiently (with constant overhead). For 
practical quantum computing, a threshold result is not 
needed - only an argument that the effective logical error 
rate, in some sense, can be pushed low enough to do in- 
teresting computations. Dropping the requirement that 
the error model definition concatenates nicely might gain 
us some freedom. Steane has investigated concatenating 
on larger codes once a smaller code like the |7, 1, 3] code 
has reduced the error rate sufficiently, and found it to be 
an apparently useful technique [15]. 

Is there a more efficient method of obtaining univer- 
sality for our scheme? Can one distill multi-qubit states 
which form a more natural universal gate set? What can 
be done with a non-adaptive circuit structure? 

The present work merely sheds light on a fact which 
had long been assumed - a threshold for a distance three 
code - but not proved. Hopefully a more solid foundation 
will help us as we try to address the more ambitious open 
questions in this field. 
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APPENDIX A: PROOF OF CLAIM Afc 

Proof of Claim . To prepare the ancilla/j . we first pre- 
pare n = 7 ancillasfc_i either |0)a;-i or |+)fe-i, then 
apply s(ji) CNOTsfc_i to entangle them, then apply 
Correctionfc_i transversely. For the Steane code, s(7) = 
9 (Fig. 1) - entangling a general n-qubit stabilizer state 
requires at most s[n) = 0[r?) CNOTs. (The ancilla 
can be prepared by transversely applying [by as- 
sumption perfect] transverse Hadamard gates to |0)fe_i, 
or can be prepared directly by symmetry.) 

Run the above procedure twice to create two unveri- 
fied ancillas jO)*;, and apply CNOTfc_i transversely from 
the first copy to the second. Then measure the second 
half in the Z eigenbasis (computational basis), postse- 
lecting on no X errorSfe_i. That is, we measure all the 
physical qubits in the second half, then in a classical com- 
puter recursively decode up to the statesfc_i. If there are 
any relative errorSfe_i or if the statcfc is not I, then dis- 
card the ancilla qubits and start over. (Discarding qubits 
based on measurement outcomes requires adaptive con- 
trol. Ancilla preparation can be implemented without 
adaptive control by verifying against two copies instead 
of one. However, our method of obtaining universality in 
Sec. V will require adaptive control anyway.) 

The probability of acceptance (normalization con- 
stant) is at least 

ATfe = 1 - 2nylfe_i - 2nBfc_i - (2s(n) + ?7,)Cfc_i, 

or one minus a unon bound on the probability of a level 
k—\ failure during the two unverified ancilla preparations 
and the transverse CorrectionSfc_i and CNOTsfe_i. In 
our parameter ranges, this will always be a constant, so 
the overhead from postselection is acceptable. 

We declare failure, conservatively, if there are two level 
k — \ failures and the ancilla is still accepted. The prob- 
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ability of failure is at most 



v 



-2n(2s(n) + n){Ak-i + Bk-i)Ck-i \ /Nk. 

^Ar? Ak-^iBk-ij 



If there are no level fc — 1 failures, then there are no 
errorSfe_i in the output ancilla, as can easily be verified 
by using the commutative diagram of Definition 5 for 
each CNOTfe_i. 

If there is one level fc — 1 failure and the ancilla is still 
accepted, then the statefc must be I, and there can only 
be one subblock in relative error. Indeed, if the single 
failure is among the transverse CorrectionSfc_i or trans- 
verse CNOTsfc-i, then in the output blocks, only one 
subblockfc_i is affected. If the failure is in one of the 
earlier CNOTsfc_i or in the preparation of ancillas/j-i, 
then all output subblocks/c-i will still be well (because 
the transverse CNOTs succeeded), and also the unver- 
ified ancillas will both have well subblocksfc_i (because 
the CorrectionSfc-i succeeded). It is then possible that 
there are X or Y relative errorSfe_i, or that the error has 
spread so there is an X or Y error in the state?; of the un- 
verified ancilla. In either case, the error will be detected 
and the ancilla discarded. 

On success, the statefc is either I or Z. Since we are 
preparing |0)fc; logical Z (transverse physical Z) has no 
physical effect. Therefore, if the statCfc is Z, we can as a 
bookkeeping reduction add Z errors to every bit to reduce 
the state to I. For this reason, we also do not need to 
check for Z relative errors. (Checking for Z relative errors 
might be a useful optimization ~ see See. IV - but it is 
very intuitive anyway that X errors are worse than Z 
errors in |0)fc ancillas prepared for X error correction as 
in Claim Bfc [15, 24].) 

Therefore, on success the output is 
wellfc(6i, . . . , 6fc_i, afc), with 



ak 



2n{Ak-i + Bk-i) + (2.s(n) + n)Ck-i 
(1 - Ak)Nk 



□ 



APPENDIX B: 5-QUBIT CODE DISTILLATION 

Lemma 1. Perfect adaptive stabilizer operations to- 
gether with the ability to prepare states pi, i = 1,2, . . 



with Block sphere coordinates Xi = yi = Zi = fi/ -\/3, such 
that fi > \/3/7 -1- e, e > 0, gives quantum universality. 
Here e > 0, a constant, is known, but the fi may not be. 

Proof. Take five of the pi and decode the five-qubit code 
stabilized by XZZXI and its cyclic permutations. Post- 
select on no detected errors. Bravyi and Kitaev [20] show 
that when the input fi are all equal to /. the output has 
coordinates x ^ y ~ z ~ — /out/v^, with /out > /• Re- 
peating the postselected decoding procedure efficiently 
moves the output towards a "magic" state for which 
a simp le a rgument gives universality. A lower bound on 
/ — \/3/7 is required in order to know how many dis- 
tillation iterations are required, but / itself need not be 
known. 

It remains to show that distillation also succeeds when 
the fi are not all equal, but only fi > f > \/3/7 + e. 
Simple algebra gives that dfont/dfi > 0, so improving 
any of the input fidelities can only improve the output 
fidelity. The probability of decoding acceptance is also 
monotone, so distillation remains efficient. 

Indeed, as a function of A, . . . , /s, the output fidelity 
conditioned on decoding acceptance is 

r _ (./1/2/3 + ■■■)- 2/1/2/3/4/5 

3+(/i/2/3/4 + ---) ' 

where ellipses indicate symmetrical terms (in the numer- 
ator, nine products of three fi] in the denominator, four 
products of four fi). Differentiate with respect to f^ - 
other derivatives are related by symmetry. Use the quo- 
tient rule d{a/b) — -^{bda — adb). The denominator is 
clearly at least nine for fi > 0. As notation, write /^ for 
fifj and similarly fijk and fijki for three- or four- wise 
products. The numerator, which does not involve /s, is 

3(/l2 + •••)- if 123 + ■■■)- {fl2hi + ■■■)- 6/1234 - 2/^234 
= /l2 (3 - /1234 — /34 - I/12/34 ~ \fl2{fl + /I)) + ■ ■ • ■ 

Each term is nonnegative when the fi € [0, 1], implying 
that /out is monotone in each fi separately. 

The probability of decoding acceptance, ^(/i/2/3/4 + 
• • • ), is also monotone. □ 



